Families and the structural relatedness among globular proteins.
نویسندگان
چکیده
Protein structures come in families. Are families "closely knit" or "loosely knit" entities? We describe a measure of relatedness among polymer conformations. Based on weighted distance maps, this measure differs from existing measures mainly in two respects: (1) it is computationally fast, and (2) it can compare any two proteins, regardless of their relative chain lengths or degree of similarity. It does not require finding relative alignments. The measure is used here to determine the dissimilarities between all 12,403 possible pairs of 158 diverse protein structures from the Brookhaven Protein Data Bank (PDB). Combined with minimal spanning trees and hierarchical clustering methods, this measure is used to define structural families. It is also useful for rapidly searching a dataset of protein structures for specific substructural motifs. By using an analogy to distributions of Euclidean distances, we find that protein families are not tightly knit entities.
منابع مشابه
The folding and design of repeat proteins: reaching a consensus.
Although they are widely distributed across kingdoms and are involved in a myriad of essential processes, until recently, repeat proteins have received little attention in comparison to globular proteins. As the name indicates, repeat proteins contain strings of tandem repeats of a basic structural element. In this respect, their construction is quite different from that of globular proteins, i...
متن کاملTarget space for structural genomics revisited
MOTIVATION Structural genomics eventually aims at determining structures for all proteins. However, in the beginning experimentalists are likely to focus on globular proteins to achieve a rapid basic coverage of protein sequence space. How many proteins will structural genomics have to target? How many proteins will be excluded since we already have structural information for these or since the...
متن کاملCatalytic domain architecture of metzincin metalloproteases.
Metalloproteases cleave proteins and peptides, and deregulation of their function leads to pathology. An understanding of their structure and mechanisms of action is necessary to the development of strategies for their regulation. Among metallopeptidases are the metzincins, which are mostly multidomain proteins with approximately 130-260-residue globular catalytic domains showing a common core ...
متن کاملAn insight into domain combinations
Domains are the building blocks of all globular proteins, and are units of compact three-dimensional structure as well as evolutionary units. There is a limited repertoire of domain families, so that these domain families are duplicated and combined in different ways to form the set of proteins in a genome. Proteins are gene products. The processes that produce new genes are duplication and rec...
متن کاملIdentifying foldable regions in protein sequence from the hydrophobic signal
Structural genomics initiatives aim to elucidate representative 3D structures for the majority of protein families over the next decade, but many obstacles must be overcome. The correct design of constructs is extremely important since many proteins will be too large or contain unstructured regions and will not be amenable to crystallization. It is therefore essential to identify regions in pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Protein science : a publication of the Protein Society
دوره 2 6 شماره
صفحات -
تاریخ انتشار 1993